Gunky Kim

ثبت نشده
چکیده

Probabilistic data linkage is an attractive data collection option when direct measurement is impossible or extremely costly. Furthermore, the cost of linking is typically small compared to the benefit that can be derived from analysis of the linked data. One important linkage application is where different data sets relating to the same individuals at different points in time are linked to provide a longitudinal data record for each individual, thus permitting longitudinal analysis for these individuals. However, without access to the same unique identifier in each of the linked data sets, there is always the possibility that linkage errors in the merged data could lead to a longitudinal record ostensibly relating to a single individual being actually made up of a composite of data items from different individuals. This in turn could lead to bias and loss of efficiency for the longitudinal modelling process. In this talk I will present recent results for unbiased regression inference using longitudinally linked data in the presence of probabilistic linkage errors. The approach is based on that described in Chambers (2009) and I will focus on the situation where the linked data are obtained by independently linking three data sources. Two scenarios will be investigated in detail. In the first, all three data sources represent different registers for the same population. In the second, I will extend this theory to where these data sources are a mix of register data and sample survey data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Gunky Objects in a Simple World

Suppose that a material object is gunky: all of its parts are located in space, and each of its parts has a proper part. Does it follow from this hypothesis that the space in which that object resides must itself be gunky? I argue that it does not. There is room for gunky objects in a space that decomposes without remainder into mereological simples.

متن کامل

Semiparametric estimation of the dependence parameter of the error terms in multivariate regression

A semiparametric method is developed for estimating the dependence parameter and the joint distribution of the error term in the multivariate linear regression model. The nonpara-metric part of the method treats the marginal distributions of the error term as unknown, and estimates them by suitable empirical distribution functions. Then a pseudolikelihood is maximized to estimate the dependence...

متن کامل

Maximum Likelihood Logistic Regression with Auxiliary Information for Probabilistically Linked Data

Despite the huge potential benefits, any analysis of probabilistically linked data cannot avoid the problem of linkage errors. These errors occur when probabilitybased methods are used to link or match records from two or more distinct data sets corresponding to the same target population, and they can lead to biased analytical decisions when they are ignored. Previous studies aimed at resolvin...

متن کامل

Comparative assessment of manual load lifting tasks in three methods: KIM-LHC, WISHA and SNOOK tables: Case study in printing industry

Background: today, musculoskeletal disorders are considered as a serious threat for workers, and one of the main activities, which leads to these problems, is lifting tasks. Moreover, workers in printing industry are widely involved in inappropriate postures and lifting loads. The aim of this study was to compare the results of assessment of lifting tasks with KIM-LHC, WISHA and Snook tables in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010